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LOCALIZATION FOR DISCRETE ONE DIMENSIONAL 
RANDOM WORD MODELS 

DAVID DAMANIK, ROBERT SIMS, AND GUNTER STOLZ 


Abstract. We consider Schrodinger operators in t 2 (Z) whose potentials are 
obtained by randomly concatenating words from an underlying set W accord¬ 
ing to some probability measure u on W. Our assumptions allow us to consider 
models with local correlations, such as the random dimer model or, more gener¬ 
ally, random polymer models. We prove spectral localization and, away from a 
finite set of exceptional energies, dynamical localization for such models. These 
results are obtained by employing scattering theoretic methods together with 
Furstenberg’s theorem to verify the necessary input to perform a multiscale 
analysis. 


1. Introduction 

We study one-dimensional discrete random Schrodinger operators H u , where the 
potential is constructed by a random concatenation of finite words, that is, vectors 
in R 7 , 1 < j < to. Our main goal is to extend known results on localization (spectral 
and dynamical) for the Anderson model and the so-called random dimer model to 
this more general class of random operators. The only requirement will be that at 
least two words wi € R 7 " 1 and u>2 G R 72 used in the construction do not commute 
in the sense that (wq, W2) and ( W2 , Wi) are different vectors in R 7l+: ' 2 , which avoids 
periodicity of the random potential. 

Generalizing known results for the Anderson model (see || and JhJ for the most 
general case of an arbitrary non-trivial distribution of the coupling constant), it 
will be shown that H u almost surely has pure point spectrum with exponentially 
decaying eigenfunctions. However, as opposed to the Anderson model, for our 
models the Lyapunov exponent may vanish on a finite set of exceptional energies. 
That this is possible was first observed by physicists in the example of the dimer 
model. It turns out that a proof of dynamical localization requires the exclusion 
of this set of energies. More precisely, we will exclude a somewhat larger, but still 
finite, set of energies, at which Furstenberg’s theorem is not applicable. 

In the case of the random dimer model, where an explicit analysis of the transfer 
matrices allows for an exact determination of the exceptional set, these results 
were proven by de Bievre and Germinet JIJ. To get finiteness of the exceptional 
set in our less explicit situation we employ tools from scattering theory (reflection 
and transmission coefficients) and a basic fact from inverse spectral theory (that 
reflection coefficients for non-trivial scattering cannot vanish identically). These 
methods were developed in our previous work Q to prove localization for continuum 
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Anderson-type models with singular distributions of the couplings, particularly for 
the Bernoulli case. We adapt these methods to the present situation and use them 
to prove positivity of the Lyapunov exponent away from the exceptional set. One 
can then follow the methods used in ||: After deducing Holder continuity of the 
Lyapunov exponent and the integrated density of states, one obtains a Wegner 
bound and an initial length scale estimate. The latter are the ingredients for a 
multiscale analysis, which yields the localization results. 

In Jnj it is shown for a subset of our models (random polymers constructed 
from two words) that a generic type of the appearing critical energies indeed leads 
to a breakdown of dynamical localization. In these cases, one actually gets su- 
perdiffusive transport. The results of [Q and 0 showed that the random dimer 
model provides an example of almost sure coexistence of spectral localization with 
dynamical delocalization. Here we generalize [Q, thus allowing one to construct 
many other examples of this type. We note that some of these examples will give 
rise to non-generic types of exceptional energies, where the dynamical properties 
are not yet known and might well be different from the behavior obtained in |jll| . 

One may also look at these models from the point of view of “subword com¬ 
plexity” of the potential—especially in the case where the potentials take on only 
finitely many values. This point of view has been discussed, for example, in || [tJ. 
The subword complexity function p : N —> N of a given potential V (taking finitely 
many values) is defined as follows: For every n G N, pin) is given by the number 
of distinct subwords of length n of V, where V is regarded as an infinite word. 
It is easy to see that for randomly generated models, the complexity function is a 
non-random quantity, that is, it is the same function for a full measure set of poten¬ 
tials. This complexity measure is popular in many disciplines since it discriminates 
nicely between periodic potentials, aperiodic potentials with long-range order, and 
random potentials. For example, a potential has a bounded complexity function p 
if and only if it is periodic and, on the other extremal end, (Bernoulli-type) An¬ 
derson models have maximal word complexity (since, almost surely, every possible 
word occurs). Heuristically, a reduction of complexity should correspond to a trend 
from localization to delocalization. Finite length (larger than 1) of the building 
blocks in random word models introduces some local correlation into the potential 
and thus reduces complexity compared to the Anderson model. While the trend to 
delocalization is not apparent on the spectral level, it shows through the result of 
(ill on the dynamical level. 

In Section 2 we define random word models, discuss some special cases, and state 
our main results. Section 3 provides a formula which relates the Lyapunov exponent 
of H u to the Lyapunov exponent of products of independent unimodular matrices 
(the word transfer matrices), thus making Furstenberg’s theorem applicable to our 
model. Sections 4 to 6 express the exceptional set of energies in scattering theo¬ 
retic terms, show its finiteness, and prove positivity of the Lyapunov exponent away 
from the excepti ona l set. Here we closely follow arguments from [pjj. The combi¬ 
natorial Lemma 6.1 allows us to apply a fact from inverse spectral theory (which 
enters through Lemma 5.2: Potentials can be reconstructed from m-functions). In 
Section 7 we briefly indicate the main steps in the remaining proof of localization, 
which follows the method from || with minimal changes. Due to the varying length 
of words, the dynamical system underlying our random operators is a generalization 
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of the two-sided shift in infinite product spaces. We include a detailed proof of the 
ergodicity properties of this dynamical system in an appendix. 
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2. Models and Results 


In this section we will present the models we consider and outline the results 
we obtain for them. Basically, we will study discrete Schrodinger operators in one 
dimension whose potentials are obtained by randomly concatenating blocks from 
an underlying set of words. To do so, we will fix a set of words and a probability 
measure on this set. This probability space will then lead to a random family 
of discrete Schrodinger operators which will be the object of study in subsequent 
sections. 

To begin, we define the fundamental set of words from which we will construct our 
operators. Fix two parameters: to € N, the maximum word length, and K G (0, oo), 
the maximum component value of any given word. More explicitly, let 

m 

W:= |J Wj, 

j=i 


where Wj = [— /G, K ] 3 . For j = 1,..., to let Vj be finite Borel measures on Wj. 
Assume that the Vj are normalized such that Xqli v j0^j) = 1- We have then that 
v, the direct sum of the uj, defined by v(U) = u j(Uj), for U = Ujli Uj with 
Uj C Wj, is a probability measure on W. 

We must further assume a non-triviality condition on the space (W,is); essen¬ 
tially, it must contain two elements which “do not commute”: 


(NC) { 


For i = 0,1, there exist Wi € W h , both in supp(z/), 
such that the two vectors: 

(w 0 (l),w 0 (2 ),... ,w 0 (j 0 ),wi(l),wi(2),.. .,wi(ji)) and 
(wi(l),wi(2),. ..,w 1 {j 1 ),w 0 (l),w 0 (2),. ..,w 0 (j 0 )) 
are distinct. 


Here supp(i') is the topological support of v (where W carries the direct sum 
topology of the topologies on Wj). 

If w € W belongs to Wj, we say that w has length j and write |w| = j. Denote 
the expectation of |tu| by 

m 

(L)=Y,Mm- 

3 = 1 

Moreover, set 

Do = W z , take Po = (^) v 

on the a -algebra generated by the cylinder sets in Dq, and 


m 

Q = (J ttj C D 0 x {1,..., to}, where fij = {u> € D 0 : ]w 0 | = j} x (1, 2,..., j}. 
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We also define a probability measure P on 12 as follows: For any Po-measurable 
A C 12o such that there is 1 < j < m with |a>o| = j for every to £ A, we let for 
1 <k<j, 

(2.1) P(Ax{fc}) = ^j-^. 


This determines P uniquely on the u-algebra in 12 generated by sets of the type 
A x {k}, see the Appendix for details. Finally, we define shifts To : 12o —► 12o and 
T : 12 —> 12 by 

(ToUj) n = LOn+i 


and 


( 2 . 2 ) 


T{u,k) = 


(u>, k + 1) if k < | wo | 
(T 0 o;, 1) if k = |w 0 j. 


It is well known that (fi 0 , T 0 ) is ergodic (e.g., m p. 49]). It can also be shown that 
(fi, T) is ergodic. More precisely, we have the following 


Proposition 2.1. Let J := {j : v{Wj) > 0}. 

(a) If J is relatively prime, then (12, T) is strongly mixing. 

(b) If J is not relatively prime, then (12, T) is ergodic, but not weakly mixing. 


As this result seems to be of some interest in its own right, and we could not 
find it in the literature, we include a detailed proof in the appendix. A special case 
of Proposition bl (see example (v) below) was used in [|lTJ without proof. If all the 
words have equal length, then the proof is simple and reduces to a discrete version 
of the suspension procedure described by Kirsch for continuum random operators 
in |p. 

We define a family of discrete Schrodinger operators as follows. For (w, k) £ 12, 
we consider the operator 


{H( u ,k)u)(n) = u(n+l)+u(n - 1) + F( w ,fc) (n)u(n) 
in I 2 (fL), where the potential results from the concatenation of 


. . . ,UJ-i,U) 0 ,CVl,OJ2, ■ ■ ■ 

such that the origin, n = 0, coincides with the Axth position in ujq. The operator 
is Z-ergodic, that is, V/ Uj k)( n ) is P-nreasurable for every n, and 

where U is the shift on I 2 (If). 

We note here that the reason for having to use the probability space 12 rather 
than the more trivial product space 12o is the fact that words have varying length. In 
essence, sequences in 12 have the additional property that the position of their zeroth 
component has also been “randomized” relative to the origin. If all words have the 
same length, that is, is(We) = 1 for some £, then in all our considerations we can 
directly work with 12 0 , (always choose V u (0) = Wo(l) and get Ht oU i = , 

where Ut is the shift by £). In this case (NC) holds whenever W contains at least 
two words. 

Let us discuss a few examples that can be studied within this framework: 
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(i) Standard Anderson model. If we set m = 1, we get f l = f2o> P = Po, and 
the potentials are just given by independent, identically distributed random 
variables. This is the one-dimensional Anderson model whose localization 
properties have been studied in many papers, most generally for arbitrary 
non-trivial distribution by Carmona et al. || and Shubin et al. fl(| . 

(ii) Generalized Anderson model. One can generalize this model by taking 
again a sequence of independent, identically distributed random variables 
with distribution supported in A C [—U,U] and using them as random 
coupling constants of a fixed single site potential. That is, the potential is 
of the form 

V u (n) = ^2 Qk(u)f(n - M), 

kei. 

where the single site potential / : Z —> R. is supported on {0, ...,£ — 
1}. In our notation, this means that W = {Aw : A £ A}, where w = 
(/(0), ...,/(£— 1)) £ KC 

(iii) Discrete displacement model. Fix integers 0 < m < £ and / : Z —> R 
supported in {0,..., m — 1}. Set 

V u {n) = f(n - k£ - d k (u)), 
fee z 

with discrete i.i.d. random variables dk taking values in {0, ... ,£—m}. This 
corresponds to W = {wo, ■ ■ ■ ,W£- m } C R £ with wq = (/(0),...,/(m — 
1), 0,..., 0), ..., w t-m = (0, ...,0,/(0),...,/(m - 1)) and v({ Wj }) = 
P (dk = j )■ (NC) holds whenever / ^ 0. 

(iv) Random dimer model. A special case of the class of examples in (ii) is given 
by the random dimer model, where one sets t = 2, / = X{o,i}- This model 
has been studied by de Bievre and Germinet in [|l). These authors were 
particularly interested in the Bernoulli case, that is, they considered the 
case W = {(A, A), (—A, —A)} for A > 0. This set clearly satisfies (NC). 

(v) Random polymer model Building on jp, Jitomirskaya et al. []ll| studied 
random polymer models, where one considers random concatenations of 
two words. Thus, #(W) = 2. Rather than studying localization properties 
for this model, these authors focused on proving non-trivial lower bounds 
on transport in the presence of so-called critical energies, that is, energies at 
which the transfer matrices associated with the two words are both elliptic 
and commute. This may very well happen even if (NC) holds, for example 
at the energies E = ±A in the dimer model if A < 1. 

Our goal is to prove, in the general context introduced above, spectral localiza¬ 
tion at all energies and dynamical localization away from a finite set of exceptional 
energies. This recovers known results for the examples in (i) and (iv) (cf. @•10) 
and, more importantly, establishes new results for the examples in (ii), (iii) and (v). 
Namely, assuming the non-triviality condition (NC), we will prove the following pair 
of theorems: 

Theorem 2.2 (Exponential Localization). For P -almost every (w,fc) £ Q, the 
operator H(u,,k) has pure point spectrum and all eigenfunctions decay exponentially 
at ±oo. 
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Theorem 2.3 (Strong Dynamical Localization). There exists a finite set M Cl 
such that for every compact interval I C 1 \ M, and every finitely supported ( f > G 
f? 2 (Z), and every p > 0, 


(2.3) 


E | sup II \X\ p e~ itH Pi(H)(j)\ 
{ t >o 


< 00 , 


where Pi is the spectral projection onto I. 


These theorems will be proven by adapting the scattering theoretic approach to 
localization developed in || to the discrete setting. More precisely, we will show 
that, away from a finite set of energies, one can apply Furstenberg’s theorem to 
yield positivity of the Lyapunov exponent and subsequently establish the necessary 
ingredients to start a multiscale analysis. It is by now well known that the above 
theorems “follow from” multiscale analysis. 

Let us briefly compare the results stated above with those found in [^l) , for the 
models in (v). While we have to exclude certain other types of exceptional energies 
as well, the critical energies studied in 0 are a special case of the energies included 
in the set M. Thus the results in [|ll| show that in Theorem T3 th e re striction to 
an interval I outside M is generally necessary. Also, by Theorem [2.2| every two- 
word random polymer model with at least one critical energy in the sense of [ [FT] 
provides an example of a random Schrodinger operator with almost sure coexistence 
of exponential localization and superdiffusive transport. 

As becomes clear from the methods used in this work, in particular Furstenberg’s 
theorem, the number of exceptional energies decreases if the set of words, that is, 
the support of the measure u, increases. One might conjecture that for suitably 
rich word spaces there are no exceptional energies. The following example shows 
that non-discreteness or even connectedness of supp v is not sufficient to guarantee 
this: Choose W = {Aw : A € A} as in example (ii) above with w = (—1, 0,1). At 
E = 0 this yields the word transfer matrix (see Section 3) 


M(Xw , 0) = 


0 -1 


1 


0 


independent of A. This implies that the Lyapunov exponent vanishes at 0 for any 
choice of A. 


3. The Lyapunov Exponent 

In this section we discuss the Lyapunov exponent, which is a quantity that 
measures the growth of transfer matrix norms. This growth is also related to 
growth/decay properties of generalized eigenfunctions associated with the operators 
H(oj,k)- Due to the structure of the underlying word space, it is of some technical 
advantage to define two families of transfer matrices and two Lyapunov exponents. 
We shall, however, show that these two quantities are essentially the same. 

Let us first work in the abstract setting of random products of unimodular ma¬ 
trices. Given w = (w(l), w(2),..., w{j)) G W and z G C, we define 

M(w, z) = T(w(j),z) x • • • x T(w(l),z), 


where for a G R, 


T(a,z) = 


z — a — 1 

1 0 
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For every z £ C, there is a number 70 ( 2 ) £ [0,oo), called the Lyapunov exponent 
(of the family of random products of matrices M(w, z), w £ W) such that we have 

(3.4) 70 (z) = lim In z) X • • • x 2 )|| for P 0 -a.e. w £ SIq. 

N—>oo 

Let us now turn to the transfer matrices associated with the operators H^ uk y 
If (to, k ) £ fi, we define 

){n,z) =T(V(u ! k)(n),z) x x T(P (tJifc) (l), z) 

with the T-matrices from above. Then, for every z £ C, there is a number 7 ( 2 ) £ 
[0, 00 ), called the Lyapunov exponent (associated with the operator family) such 
that we have 

(3.5) 7 (z) = lim — In \\Mr u k){ n i z)|| for P-a.e. (w, fc) £ ft. 

n—>o o 77, v 5 / 

The main purpose of this section is to show that 7 is a fixed multiple of 70 : 
Proposition 3.1. We have for every z £ C, 


(3.6) 


7 o (z) = (L)'y(z). 


Proof. Let fl 0 be the full measure set of those to £ n 0 such that (T4) holds and 
also 

1 fc 


— \uii\ —* (L) as fc ■ 


For w £ r^o, it is easily seen that 


Since 


we conclude from (|bj) that (|3.6|) holds. 


lim — In ||M( W !)(n, z)|| = -yvw-- 
n—>co n (L) 


P{(w, 1) : u £ } — -^r > 0, 


□ 


4. Floquet Solutions Associated With a Periodic Potential 

In this and the following section we use various facts about algebraic functions, 
in particular that they have only finitely many roots. We refer to | fl4| for their 
general theory. 

Let Vper : Z —> K. be p-periodic, that is, V per (n + p) = V pel (n) for every n £ Z. 
We start by collecting some facts from Floquet theory for the periodic operator 
Ho := A + Vper, where (A u)(n) = u(n + 1) + u(n — 1). For any z £ C, let un (-, z) 
and Un(-,z) denote the solutions of 


(4.7) u(n + 1) + u(n — 1) + V pel (n)u(n) = zu(n) 

with ujv(0) = uz>(l) = 1 and ujy(l) = ud(0) = 0. The transfer matrix of ( |4.7| ) 
from 1 to p is the matrix 
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u D (p+l,z) u N (p + l,z) 
u D (p,z) u N (p,z) 


= T(Vp ei (p),z ) x • • • x T(V pei (l),z), 


which is unimodular with polynomial entries in z. The eigenvalues of go{z) are the 
roots of 


(4.8) p 2 - D{z)p + 1 = 0, 

that is, 


(4.9) 


P±{z) 


D(z) ± y/D{z) 2 -4 
2 


where D(z) = Tr[g 0 (-s)]- As roots of ( |4.8| ), the functions p± are algebraic with 
singularities at points with D(z) = ±2. 

The spectrum of Hq , cr(Ho), consists of a finite number of bands which are given 
by the set of real energies A for which |Z)(A)| < 2. Let (a, b) be a stability interval 
of Hq, that is, a maximal interval such that |.D(A)| < 2 for every A G (a, 6). For 
real A G (a, b), one has that 


P±(A) = i(^(A)±*V4-D(A) 2 ), 
P±(A)| = 1, and p~( A) = p+( A). Let 


S := {z G C : z = A + ip where a < A < b and p G R} 
be the vertical strip in the complex plane containing (a, b). For z = A + ip G S, one 
has that the following are equivalent: 

(i) \p±(z)\ = 1, (ii) V = 0, (in) D(z) G (-2, 2). 

The implications (ii) =>• (iii) and (iii) => (i) are clear. To see that (i) => (ii), assume 
that (i) is true for some z = A + ip, where p ^ 0. As both p± have modulus 1, 
all solutions of ( [L7| ) are bounded. By Weyl’s alternative, however, if p ^ 0, then 
there exists a solution in £ 2 near +oo, the Weyl solution, while all other solutions 
are unbounded. This is a contradiction. 

The arguments above imply that p± have analytic continuations to all of S. 
These continuations remain algebraic and the only possible singularities occur at 
a and b. In addition, as p+(z)p-(z) = detgo(~) = 1, we have by continuity of | - | 
that exactly one of p+ and p_ satisfies \p{z)\ < 1 in the upper half-plane. Without 
loss of generality, let us denote by p + the eigenvalue for which |p+(A + ip)\ < 1 for 
all p> 0 and A G (a, b). This corresponds to choosing a branch of the square root 
in (|4.9| ). Then, p_ satisfies |p_(A + ip)\ > 1 for all p > 0 and A G (a, b). Since we 
also have |/?±(A)| = 1 for A G (a, b), it follows from the Schwarz reflection principle 
(apply a fractional linear transformation) that \p+(\ + ip)\ > 1 and |y 0 _(A-F* 77 )| < 1 
for all p < 0 and A G (a, b ). 

For z G S, let v±(z) be the eigenvectors of g 0 (z) corresponding to p±(z) with 
the second component normalized to be one, that is, 


v±(z) = 


(4.10) 


1 

c±(~) 
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One may easily calculate that 


(4.11) 


, n P±(z) - U D (p+l,z) 
C±(Z) = u N (p + l,z) 


Here the denominator cannot vanish for z £ S: Suppose that ujv(p+1, z) = 0. Then 
z is an eigenvalue of the finite Jacobi matrix with diagonal (Jp e r(2),..., V p er (p)) 
and off-diagonal elements 1. Thus z and the solutions ud(-, z) and ujv(-, z) are real. 
It follows from 1 = detgo(z) = ud(p + 1, z)un(j>, z) that Tr <7o(*0 = md(p + 1 ,z) + 
ud(p + 1, z)- 1 > 2. Therefore z is either an endpoint of a stability interval or in a 
gap of H 0 . 

Thus, v± are analytic in S. In particular, as un{p+ 1, -) _1 is a rational function 
with at most a pole at a and b, we have that c±, and therefore v± as well, are 
algebraic functions with, at worst, singularities at a and b. Let <j>±{-,z) be the 
Floquet solutions of ©, that is, the solutions satisfying 


(4.12) 


f <MM) 

V </>±(0 ,z) 


v±(z). 


We first note that X+irj) £ near ±oo if 77 > 0, and <j>±(-, X + ip) £ £ 2 near 
q= 00 if 77 < 0. Thus in this setting, the Floquet solutions are the Weyl solutions. 
Secondly, for fixed n, <t>±(n, ■) are algebraic functions, which are analytic in S with, 
at most, singularities at a and b arising from the singularities in the initial conditions 
v±. Lastly, {(/>+(•, z), </>_(•, z)} are a fundamental system of © for every z £ S, 
as p+{z) ^ P-{z) on S. 


5. Scattering with Respect to a Periodic Background 

Let Vp er be a p-periodic potential. We will insert a local perturbation. That is, 
given real numbers W \,..., W m , we define a potential V by 

( Vper (n) if n < 0 
V(n) = < W n if 1 < n < m 

[ Vpe r (n — m) if n > m + 1. 

Consider the operator H = A + V in <? 2 (Z). Take z £ S, where S is as above, and 
let u+ be the solution of 


(5.13) u(n + 1) + u(n — 1) + V (n)u(n) = zu(n) 

satisfying u + (n) = 4>+{n) for negative n. Since V(n) and V pei (n) coincide for 
n < 0, we get that u+(n) = </>+{n) for n < 1. Moreover, to the right of the 
perturbation, the solution u + can be written as a linear combination of <t>+(- — m+p ) 
and 4>_ (-—m+p), and this identity holds then for n > m since V(n) = V peT (n—m+p) 
for n > m + 1. In other words, there are numbers a(z) and b(z) such that 


(5.14) u+(n,z) 


<f>+ (n, z) for n < 1 

a(z)(f>+(n — m + p, z) + b(z)(f>-(n — m + p, z) for n > m. 


Since <f>± are linearly independent for z £ S, this defines a(z) and b{z) uniquely. 
The numbers t(z) = 1 /a(z) and r(z) = b(z)/a(z) are discrete analogues of the 
classical transmission and reflection coefficients, at least if Iper = 0. In particular, 
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vanishing of 6 is equivalent to vanishing of the reflection coefficient. Thus b and u+ 
take on the role of a (modified) reflection coefficient and Jost solution relative to 
the periodic background U P er, respectively. 

Since for A £ (a, 6), we know <j>-{n , A) — <j>+(n, A) from ( 4.10| ), ( |4.1l|) , and ( 4.12 ), 
by taking u- to be the solution of m with u-(n, A) = </>_(n, A) for n < 1, we 
get that for n > m, 


(5.15) U-(n, A) = a(A)</>_(n — m + p, A) + b{\)(f) + {n — m+p, A). 

Using constancy of the non-zero Wronskian (f> + (n + 1 )^>_(n) — 0_(n + 1 )<j> + (n), we 
arrive at the familiar relation 


(5.16) a (A)| 2 — |6(A)| 2 = 1, 

for A £ (a, 6), corresponding to |r| 2 + |t| 2 = 1. 


Proposition 5.1. a(-) and b(-), defined on S as above, are algebraic functions with 
singularities only possible at the boundaries of stability intervals. 


Proof. Recall that u+ is the solution of (5.13) with 


( u+(l,z) 

V «+(M) 


<MM) 

<MM) 


v+(z). 


Thus ( u+{m + 1, z), u + {m, z)Y is algebraic in S with singularities only possible at 
a and b, that is, the boundaries of the stability interval. As was determined before, 
the same is true for both (4>±(p + \,z),(j>±{p,z)) t . By the definition of a(z) and 
6(z), we have 


(5.17) 


a(z) 

b(z) 

and so we are done. 


fi+(p+l,z) 

<t>+(P,z) 


.(p+l,z) 
i>- ( p, z ) 


u + (m + 1 , z) 
u+(m,z ) 


□ 


Lemma 5.2. //6(A) = 0 for all A € (a, b), then V = V F 


per • 


Proof. Suppose that 6(A) = 0 for all A £ (a, 6), and hence all z £ S by analyticity. 
We know then that for every A £ (a, 6) and rj > 0, the Jost solution u+(n, A + ip) 
is a(A + ir])(/)+(n — m + p, A + ip) (for all n > m), that is, u+ is exponentially 
decaying in this region of the upper half-plane. Thus it+ is the Weyl solution for 
the perturbed equati on ([5.13 ). We may therefore calculate the Weyl-Titclimarsh 
m-function, my, for (5.13|) on the half-line [l,oo), 


my (A + ip) = 


it+(l, A + ip) 
u+(0, A + ip) 
</+( 1 , A + ip) 


<j) + (0,\ + ip) 

= rn Vpet (\ + ip), 

where the latter is the m-function of (O) on [1, oo ) (which is the same as the m- 
function of ( |5.13 ) on [m+1, oo)). As the m-functions are analytic in the entire upper 
half-plane, we conclude that my(z) = my pt!r (z) for all z £ C + . Thus, by standard 
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results from inverse spectral theory (see, e.g., J^j), we conclude that V = V pe r on 
[l,oo). Since they coincide on (—oo,0] by definition, we get V = tp e r- □ 

Thus, if we restrict our attention to the case V ^ V(, er , we know that {A £ (a, b ) : 
b( A) = 0 } is finite. 

Now we consider a gap. Take a such that —oo < a < a < b and (a, a) is a 
maximal, non-trivial gap in the spectrum of Hq (if a = inf cr(Ho), then a = —oo). 
We note that there is also a gap (b, oo) where b = sup(cr(Ho)). The analysis of this 
gap is identical to the case a = — oo below, excepting that we analytically continue 
to the right rather than the left. Consider the following split strip: 


S' := {z = A + ip : a < A < b, 77 £ R} \ [a, b). 

For i = 1,2, let Pi(z) be the branches of ( |4.8| ) with \pi(z)\ < 1 and |p 2 (z)| > 1 for 
all z £ S', which are well defined since [a, b) is excluded. We first note that p\ = p + 
on the upper half of S, but pi = p_ on the lower half of S. Secondly, as before, it 
can be seen that pi are algebraic in S'. They have, at most, singularities at a, a, 
and b , and therefore they may be continued analytically across (a, b). In particular, 
Pi is the analytic continuation of pj , where i,j £ (1, 2} with i ^ j. 

For z £ S', choose ei genve ctors 17 ( 2 ) = ( Ci(z),iy of go(z) corresponding 
to pi(z), analogously to ( 4.10 ). Taking <pi to be the solutions of (4.7) with 
z), 4>i(0, z)Y = Vi(z) for z £ S', we see again that <j>\ (-,z) (resp., <j> 2 (-,z)) 
is in £ 2 near +00 (res p., — 00 ), that is, they are the Weyl solutions. Set Ui to be 
the Jost solutions of ( [5.13| ) satisfying 


(5.18) Ui(n,z) 


<t>i{n, z) n< 1 

ai(z)4>i(n — m + p, z) + bi(z)4>j(n — m + p, z) n > m 


for z £ S' and the same i,j convention used above. As in dH) above, one sees 
that a\(z) and bi(z) are algebraic in S'. In fact, in the upper half S+ of S, they 
coincide with a(z) and b(z), since for z £ S+, we have that v+(z) and vi(z) coincide. 
Thus 01 ( 2 ) and b\(z) are analytic continuations of the restrictions of a(z) and b(z) 
to S+. Similarly, it is seen that 02 ( 2 ) and 62 ( 2 ) are analytic continuations of the 
restrictions of a ( 2 ) and 6 ( 2 ) to the lower half S- of S. Thus, 01 ( 2 ), 61 ( 2 ), 02 ( 2 ), 
and 62 ( 2 ) are algebraic and have, at most, singularities at a, a, and b. In particular, 
when V ^ V^er, they cannot vanish identically, and hence the set 


(5.19) {A e (a, b) : oi(A) 6 i(A)a 2 (A) 6 2 (A) = 0 } 

is finite, even in the case a = — 00 . 


6. Furstenberg’s Theorem and Positivity of the Lyapunov Exponent 


To prove positivity of 70 (A) for A £ R, away from a finite set, we will investigate 
properties of the transfer matrices. We assumed in (NC) that there are two words 
wq, w\ in supp(i') which do not commute, and we will therefore work with the 
following pair of transfer matrices: the “free” matrix 


go (A) = M(wq, A), 


corresponding to the periodic problem (4.7) and the matrix 


5 i(A) = M(w\, A) 
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descri bing the local perturbation, corresponding to the perturbed difference equa¬ 
tion ( 5.13 ). In correspondence with the previous sections, we take Hq to be the 
operator with p-periodic potential Tper, where p = |wo|, such that V pel (n) = wo(n ), 
1 < n < p, and H to be the operator generated by the perturbed potential V 
results from IA per by inserting the m numbers W\,..., W rn , where m = |iui| and 
W n = w\(n ), 1 < n < m. Of course, this convention is arbitrary and the roles of 
the two transfer matrices could be interchanged. 


Lemma 6.1. Assume that (NC) holds with non-commuting words wq,wi. Then, 
we have for the potentials Tper, V defined above, 


( 6 . 20 ) Vr ^ V. 

Proof. Assume that ( 6.20| ) fails. Then, we get that wq is a power, that is, there is 
some v, which itself is not a power, and some s > 2 in N such that wo = v s = vv ... v. 
We have then that 


(6.21) wivvv ... = vvvv ... 

by considering the restrictions of Vp er and V to N as one-sided infinte words. If the 
length of w\ is not an integer multiple of the length of v, we can again argue that 
v must be a power, and hence get a contradiction. Thus, the length of w i is an 
integer multiple of the length of v. By inspection of (6.21), this implies that w±, 
too, is a power of v. Thus w\ = V s and hence we get a contradiction to (NC) since 

WqWi = V s+s = W\Wq. □ 


Set G(A) to be the closed subgroup of SL(2,R) generated by {M(w, A) : w £ 
supp(^)}. Let P(M 2 ) be the projective space, that is, the set of the directions in 
]R 2 and v be the direction of v £ R 2 \ {0}. Note that SL(2,R) acts on P(R 2 ) by 
gv = ~gv. We say that G C SL(2, R) is strongly irreducible if and only if there is no 
finite G-invariant set in P(R 2 ). 

It follows from Furstenberg’s theorem [Q that 70 (A) > 0 if G(A) is non-compact 
and strongly irreducible. 

The main result of this section is the following: 


Theorem 6.2. Assume that (NC) holds. Then there exists a finite set M C R, such 
that G(A) is non-compact and strongly irreducible for all A £ R \ M. In particular, 
70 (A) > 0 for all A € R \ M. 


Proof. The proof of Theorem T2 is analogous to the proof of Theorem 2.3 in [j| so 
we only sketch the argument briefly. 

When non-compact, it is known that the group G is strongly irreducible if and 
only if for each v £ 


(6.22) #{gv : g £ G} > 3; 

see [Q. Note that both non-compactness ofG(A) and ( |6.22 ) are properties which are 
preserved if the set of underlying words is enlarged. Thus one can assume without 
loss of generality that G(A) is generated by 50 (A) and 51 (A). 

By an analysis which is almost identical to the one in ](]], non-compactness and 
( 3.22|) can be shown away from the roots of b (finitely many in each of the finitely 
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many stability intervals), the roots o f D (one per stability interval), the endpoints 
of stability intervals, and the set in ( 5.19 ); and hence away from a finite set. 

For A in a gap, non-compactness follows since go(X) has an eigenvalue of modulus 
larger than 1. The set ( 5.19| ) is avoided to verify ( |6.22| ). 

If A is in a stability interval, then the crucial link to scattering coefficients is 
given by the fact that G(A) is conjugate to the group G(A) generated by the two 
matrices 


<7o (A) 


cos uj sin uj 
— sin uj cos uj 


, s(A) 


Re[a(A) + 6 (A)] Im[a(A) + 6 (A)] A 
—Im[a(A) — 6 (A)] Re[a(A) — 6 (A)] ) ’ 


where uj = uj(X) £ ( 0 , 7 r) such that p + ( A) = e IW ; see || Lemma 2.4]. This allows 
for a very explicit analysis of the group G(A) in terms of a(A), 6 (A) and uj. From 
6 (A) 7 ^ 0 one can conclude non-compactness, and D( A) 7 ^ 0 means uj 7 ^ 7 r/ 2 , 
implying ( |6.22 ). We refer to |J for details. □ 


7. Proof of Localization via Multiscale Analysis 


The proofs of Theorems 2.2 and 2.3 involve the technical machinery of multiscale 
analysis, including the verification of the necessary ingredients which are known to 
make this whole apparatus work. Still, given the results which were established 
in the previous sections, we can conclude with a brief sketch of the remaining 
arguments. This is due to the fact that, from here on, the strategy which was 
developed by Carmona et al. [|j for the special case of the Anderson model can 
also be followed for the more general random word models studied here. The main 
point is that one has to stay away from the finite set of exceptional energies. 

First of all, for energy intervals in which positivity of 7 can be established through 
Furstenberg’s theorem, that is, where the group G(A) is non-compact and strongly 
irreducible, an analysis of the A-dependence can be performed which yields Holder 
continuity of 70 , and thus 7 . Details on this, which other than smooth dependence 
of the transfer matrices on A uses only general facts from the theory of products of 
independent SL{2, R)-matrices, can be found in || and [Q; see also Q, where it is 
shown that this analysis also applies to continuum models. 

Through the well known connection of 7 and the integrated density of states 
(IDS) of ergodic discrete Schrodinger operators provided by the Thouless formula, 
one then obtains Holder continuity of the IDS. 

Combined with the positivity of 7 , the latter is the basis for proving a Wegner 
estimate as well as an initial length scale estimate , both away from exceptional en¬ 
ergies, suitable for starting the multiscale analysis. The details of this are straight¬ 
forward extensions of the arguments provided in || for the Anderson model. 

That multiscale analysis, given Wegner and init ial length scale estimates, proves 
exponential localization as claimed in Theorem 2.2, has been known since the 1980’s 
and is stated in || together with all the fundamental references. Improved ver¬ 
sions, which show that strong dynamical localization as in Theorem |2.3| is obtained 
through multiscale analysis, were recently provided in || and While these 

papers are written for continuum operators, they both note that their results apply 
to lattice models as well. 
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Appendix 


The aim of this appendix is to prove Proposition 2.1. We will start by pro¬ 
viding a semi-algebra which generates the P-measurable sets and justifying that 
P is uniquely determined by ( [2.l| ). We will then check that T, given by (2.2), is 
measure-preserving and show the asserted ergodicity properties by working on the 
semi-algebra; see Theorems l.f and 1.17 of |l8|]. The semi-algebra we construct con¬ 
sists of suitable cylinder sets. With these explicit sets we are able to demonstrate 
that the main idea of the classical proof of ergodicity of shifts, namely that two 
given cylinder sets become independent if their non-trivial components are shifted 
into disjoint regions, still holds. Due to the varying word length, the details of this 
argument become more cumbersome, which is our reason for including them here. 

Let Bj denote the Borel sets in Wj, j = 1,... ,m, and B the Borel sets in W 
(i.e., the i/-measurable sets). The set of all measurable cylinders of the form 


-(«+!) 


(A.l) 


]~j W x Ai X A 0 x Ai X I V. 


i= 1 


i=n -\-1 


where n £ N, A 0 £ Bj , and Aj £ B , for 1 < |i| < n, is a semi-algebra in {u> £ f2 0 : 
l w o| = j }; denote it by Sj. Here the ~ symbol denotes the zero position in W z . It 
follows that 


(A.2) 


l=i 


is a semi-algebra in = (J™_ 1 Qj, where Clj = {u £ do : |wo| = j} x {1,..., j}. 

We define a function P : S —* by P(A x {fc}) = ¥q(A)/(L) whenever A £ Sj 7 
and 1 < k < j. P is countably additive since Pq is countably additive and 


m J 


(A.3) 


EE P n WxWj xJlwj x{k}\= 1 . 

j —1 k— 1 \ \i =—oo 2=1 / / 


Thus P can be uniquely extended to a probability measure P on the er-algebra T 
in generated by S. By construction, we have that P(A x {k}) = Po (A)/(L) for 
every Po-measurable A C {u> £ Do : |u>o| = j} and 1 < k < j. Define T : H —> fl by 

& 

Lemma A.l. T is a measure-preserving bijection. 

Proof. T is a bijection with 


(A.4) 


T~\uj,k) = 


(v,k- 1) 


if k > 1 


(T 0 i w, |w_i|) if fe = 1. 


To prove that T is measure-preserving it suffices to show that P(T 1 M) = P(M) 
for all M £ S. Let M = Ax {fc}, A £ Sj, 1 < k < j. Thus P(M) = Po (A)/{L). 

If k > 1, then T~ X M = A x {k - 1} and P (T~ 1 M) = P(M). 

If k = 1, represent A in the form (A.l) and decompose 


• ' I - U O, C i eB r 

i=i 
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Then, T *(A x {1}) = U^Li( A i x {j}), where 


—n — 1 n+1 oo 

A, = n w x n a ‘- 1 x Cj x n ax n w 

i =—oo i=-n-\-1 i—1 i=n-\-2 

Since Po is additive and To is measure-preserving, we get 

P(T~\A x {!})) = f>(A, x O'}) = E ^77^ = W7W = P (^ x «)- 


. 7=1 


. 7=1 


(L) 


(L) 


concluding the proof. 


□ 


Proof of Proposition \2. (a) We assume that J is relatively prime. In order to 

show that T is strongly mixing, we need to prove that 

(A.5) lim P(T-*(4 k A ) n (B, k B )) = P(A, k A ) ■ P (B, k B ) = ’ Po(g) , 

l —>oo (T )' 2 

Here {A, k A ) C H and (B,k B ) C H are arbitrary sets of the form 

(A. 6 ) 

where n € N, 6j e , Dj € , j = —n, ■ ■ ■ ,n, 1 < k A < ao, and 1 < k B < bo- 

The sets (A, k A ) with A as in (El are also a generating semi-algebr a S f or T 
(take finite disjoint unions to get 5). Thus strong mixing follows from (A.5) and 
(T| Theorem 1.17 (iii)]. 

In order to calculate the left-hand side of (A.5), we will have to write T~ e (A, k A ) 
as a disjoint union of sets ( C,k ) with C of type (A.l). This is simple for £ = 
£o + i + • • • + a-n — 1 , where 


A = ■ ■ ■ X W X C-n X-"XCoX"-xC B xWX"-, 
B = • • • x W x T>_ n x • • • x D 0 x • • • x D n x W x • • • , 


(A.7) 

and 

(A.8) 


T- £ °(A,fc A ) = (Ao,l), 


An = • • • X W X X • • • X X W X 


For ( > 4, we have T~ e {A, k A ) = T _ (^°)(A 0 ,1). Splitting sufficiently many of the 
W-factors in (A. 8 ) into their disjoint components VV ? , we see that T _ ^ _ ^ 0 ^(A 0 ,1) 
is a disjoint union of sets of the form (A(ji,..., j r ), k) where 

(A.9) A(ji,...,j r ):=.-xWxW jl x-xlV jr xC_ T 1 x-xC„xWx-. 

We only need to determine the (A(j i,... ,j r ), k) in T _ ^ _ A>)(a 0 , 1 ) with fc = k B , 
since otherwise the intersection with ( B,k B ) is empty. Comparing (A. 8 ) and 
(A.9) and counting the length of the words shows that (A(ji, ...,j r ), k B ) C 
7 i -(c-^o)(A 0 , 1 ) if and only if r and ji, ■ ■ ■ ,j r are such that 

(A.10) k B < ji < TO, 1 < j 2 , ■ • •, > <m, ji + ...+jr = k B + (£- £ 0 ) - 1- 

This shows that 

(A. 11 ) T~\A, k A ) n (B, k B ) = lJ(A(j 1 ,..., j r ) n B, k B ), 
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where the disjoint union is taken over all r,j 1 ,... ,j r as in (A. 10). If £ is sufficiently 
large, then we have for all the sets on the right-hand side of (A.11), that 


A(ji, . . . , jr) n B = ■ ■ ■ X W X D_ n X • • • X £>_ 1 X (L>0 n W h ) x • • • 

• • • x {D n n W Jn+1 ) x Wj n+2 x • • • x Wj r x C- n x • • • x C n x W x 


Hence, 


(T~\A, k A ) n (B, k B )) = •••,>) n B)) 


E p o ( B n (• ■ ■ X w x x ■ ■ • x W jr x W x • • • )) • P 0 (H). 


(L ) 2 

Taking into account further that 

B n (• • • x w X w n x ■ ■ ■ x Wj r x w x ...) ^ 0 
only if (bo, • • •, b„) = (ji, ..., jn+i), in which case 

P 0 (b n (• • • x W x W h x • • • x W jn+1 x W x ■ ■ •)) = P 0 (B), 
we conclude 

(A.12) P (T-‘(A, k A ) n (B, k B )) = E ^ +2 • ■ ■ ■ ■ Po(A)P 0 (B), 

where the sum runs over 1 < j n+2 ,..., j r < m,j n+ 2 + ... +j r = £ — Here we 

have s et Ia .b := 4) + ^0 + • • • + b n + 1 — k B and Vj := z'(Wj). Thus, after an index 
shift, ( 0 ) becomes equivalent to 

( A - 13 ) E 1 1 


1 < 3 1, 

jl + • • • + js = t 


(L) ET=i3 


Since Y^jLi v :i = 1 and J is relatively prime, this is exactly the result proven in 
Lemma [A.2| (a) below. This finishes the proof of part (a). 

(b) If J is not relatively prime, then by Lemma |A.2| (b) below, the left-hand side 
of ( A.13| ) converges to (L } _1 in Cesaro mean. Our previous arguments then yield 
that 

d 

’ (: T-\A , k A ) n (B, k B )) = P (A, k A ) • P (B, k B ), 


lim ^E 1 

d— >00 d ■*"— J 
1 = 1 


for all (A,k A ) and (B,k B ) in S. By [Jl8|, Theorem 1.17 (i)] this implies ergodicity 
of T. 

To complete the proof of part (b), it remains to check that in the latter case, 
T is not weakly mixing. To this end, note that by the non-triviality condition 
(NC), there exist j € {1,..., to} and C G Bj such that 0 < Vj (C) < 1. Choose 
A := • • • xW x C xW ■ • •, that is, A x { 1 } € S. If £ is not a multiple of the greatest 
common divisor D > 1 of J, then by considerations as above one sees that 

p (r-V, 1 ) n (A, 1)) =0. 

Since P(A, 1) = Vj(C)/(L ), this implies that 

liminf \ 1 1 nr-‘(A, i) n (A D) - P(A D 2 | > } (^f) > o. 
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Thus T is not weakly mixing; see [jl 8 [ Theorem 1.17 (ii)]. 


□ 


In the above we have used a combinatorial lemma, which we state and prove 
below. Let m G N be fixed. For any £ G N, let the set of unordered partitions, or 
compositions, of t by natural numbers less than or equal to rn be denoted by 

P{£,m) := | (ji, j s ) G {1, •• -,m} s : s G N and Y j r = £ 

l r= 1 

and set j s := (ji,... ,j a ) G P(£,m). 


Lemma A.2. For j = 1,..., m, suppose 0 < Vj < 1 are given with X^j=i v :i = 1 ■ 
(a) If J := {j : Uj > 0} is relatively prime, then 


(A.14) 


lim 


E ■ 

j B £P(£,m) 


n Tj 2 



1 

1 j ■ V 3 ' 


(b) If J is not relatively prime, then 


(A.15) 


1 


d 





1 

1 j ■ V 3 ' 


Proof. For each feN, define 

(A.16) A t := Y v h • v h ■ ■ • • • v :is > 

j B eP(t,m) 

and for any z € C with \z\ < 1, set 

OO 

(A.17) fa) :=Y A ^- 

l=\ 

Clearly, 

OO 

fa) = E E "h ■ ■ ■ ■ ■ ■ Vj,** 

oo m m 

= E E ■ • ■ E ■ u 32 ■ ■ ■ ■ ■ 

»=iji=i j»= i 

oo (m \ S 

= E E^ 

s =i \j=i / 

E 777 . A 

j=l V 3 Z3 

Take Aq := 0 and consider 

fa) ■= (! - z)fa). 

It is clear that both <f> and <p are analytic in \z\ < 1. 

Observe that the set J := {j : Vj > 0} is relatively prime if and only if the 
equation Y^jLi v j z3 = 1 has exactly one solution (z = 1 ) on the unit circle. 
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We may conclude that if J is relatively prime, then <f> is analytic in a neighbor¬ 
hood of {z G C : \z\ < 1}, and 

lim A e = 0(1) = 1 . -. 

Ej=l J ' U 3 

This completes the proof of (a). 

If the set J is not relatively prime, then let D be the greatest common divisor 
of J, that is, suppose each j £ J can be written as j = Dkj , and let K := {kj : j = 
Dkj G J}. 

Rescale the Vj as follows: for each 1 < j < to, define 


r o if j i. k 

\ v Dj if j G K. 


Note that the set J := {j : kj > 0} is relatively prime by construction. Taking 


:= 


Vi, ‘TOO • ••• • V.i. 


one sees that 


Thus 


A( = 


0 if 


D 


N 


Aj if£=jeN. 

1 . d , 1 

lim -YAt= lim - V' A Dj . 

ft UYl n • ^ H KY1 ff • ^ 


j£N:Dj<d 


Letting m = d/D, we have that 


.j d .j m 

lim — N An = — lim — y A, = 

d— n ' Y) m —>oo ?77, ' 


£=1 


J=1 


V 7 
A—/7 — 1 J 


which completes our proof. 


□ 
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